3574 results found.
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
Various word fillings for 50 Mad Lib stories. OtherProduction Status:
Newly created-finished
Use:
Humor Generation
-
Paper title:"Judge me by my underline{size} (noun), do you?'' YodaLib: A Demographic-Aware Humor Generation Framework
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Aparna Garimella | Demographic-aware Mad Libs | /N |
Documentation:
Will be available upon paper acceptance
Written
Corpus,
Language Type:
Multilingual
Languages:
English Nepali Sinhala
Availability:
Freely Available
License:
Creative Commons Attribution Share Alike 4.0 International
Size:
approximately 18,000 sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Is MAP Decoding All You Need? The Inadequacy of the Mode in Neural Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Bryan Eikema | Flores Evaluation Set | /N |
Documentation:
None
Written
Corpus,
Language Type:
Bilingual
Languages:
English German
Availability:
Freely Available
License:
Size:
5,900,000 sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Is MAP Decoding All You Need? The Inadequacy of the Mode in Neural Machine Translation
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Bryan Eikema | WMT 2018 News Translation Task Data | /N |
Documentation:
None
Modality Independent
Language Modeling Tool,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
GNU GPL 3.0
Size:
15.7 KByte Production Status:
Newly created-finished
Use:
Natural Language Generation
-
Paper title:Multi-Word Lexical Simplification
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Piotr Przybyła | Plainifier | /N |
Documentation:
None
Written
Corpus Tool,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CC BY-NC-SA 4.0
Size:
7059 entries Production Status:
Newly created-finished
Use:
Natural Language Generation
-
Paper title:Multi-Word Lexical Simplification
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Piotr Przybyła | MWLS1 (Multi-Word Lexical Simplification dataset 1) | /N |
Documentation:
None
Written
Grammar/Language Model,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CC BY-NC-SA 4.0
Size:
1.3 GByte Production Status:
Newly created-finished
Use:
Language Modelling
-
Paper title:Multi-Word Lexical Simplification
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Piotr Przybyła | TerseBERT | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
Size:
240M sentences Production Status:
Existing-used
Use:
Machine Learning
-
Paper title:Native-like Expression Identification by Contrasting Native and Proficient Second Language Speakers
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Yugo Murawaki | L2-Reddit Corpus | /N |
Documentation:
None
Written
Multi-label classification/extraction,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
CreativeCommons
Size:
10 MByte Production Status:
Newly created-in progress
Use:
Information Extraction, Information Retrieval
-
Paper title:Retrieving Skills from Job Descriptions: A Language Model Based Extreme Multi-label Classification Framework
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Akshay Bhola | mycareersfuture_dataset | /N |
Documentation:
https://github.com/WING-NUS/JD2Skills-BERT-XMLC
Written
Corpus,
Language Type:
Bilingual
Languages:
English Spanish
Availability:
From Owner
License:
Size:
1,297,000 articles OtherProduction Status:
Existing-used
Use:
Machine Learning
-
Paper title:Event-Guided Denoising for Multilingual Relation Learning
-
Paper track:Short paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Amith Ananthram | Reuters RCV1/RCV2 Multilingual Corpus | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
English
Availability:
Freely Available
License:
OpenSource
Size:
127600 samples OtherProduction Status:
Existing-used
Use:
Document Classification, Text categorisation
-
Paper title:RANCC: Rationalizing Neural Networks via Concept Clustering
-
Paper track:Long paper/
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Housam Khalifa Bashier | AGNews dataset | /N |
Documentation:
None




